Broadcast switch system in a network-on-chip (NoC)

ABSTRACT

A system and methods of use for a broadcast switch system, broadcast management switching system, and methods of use in network-on-chip are presented. The invention relates generally to broadcasting transactions in a network-on-chip (NoC). More specifically, and without limitation, the invention provides for transacting from master to multiple slaves and for receiving responses. The invention relates to a broadcast switch for broadcasting transactions. More specifically, and without limitation, the invention relates to a broadcast switch system, broadcast management switching system, and methods of use in NoC.

FIELD OF THE INVENTION

The invention relates generally to broadcasting transactions in anetwork-on-chip (NoC) and, more specifically, to a broadcast switch forbroadcasting transactions in a broadcast management switching system.

BACKGROUND

The system creation, design, and development of computer processorsincludes multiprocessor systems. Multiprocessor systems have beenimplemented in systems-on-chips (SoCs) that communicate and interactwith network-on-chips (NoCs). The SoCs, generally, include instances ofmaster (initiators) intellectual property (IPs) and slave (targets) IPs.In some instances, one master sends a transaction or request to multipleslaves. The transactions are sent using industry standard protocols,such as ARM, AMBA, AXI, AHB, or APB; or OCP-IP. The protocols have astrict request/response semantic, and typically are treated by an NoC asunicast; the master, connected to the NoC, sends a request to a slave,using an address to select the slave. The NoC decodes the address andtransports the request from the master to the slave. The slave handlesthe transaction and sends a response, which is transported back by theNoC to the master.

The modern, and known, application utilized when a master needs to sendthe same transaction and/or request to multiple slaves, is for themaster to send all the requests sequentially. The master sends thetransaction to the first slave, then to the second slave, then to thethird slave, and so on. For example, if a master wants to write the samedata into 16 different slaves, the master sends 16 identical writetransactions, in sequence, with one going to each slave. Thus, the timetaken by the total operation, the time spent to write one transaction ismultiplied by 16, is 16 times the amount of writing a singletransaction. This limits the rate at which an identical request can besent to multiple slaves. In other words, the rate is limited by the rateat which a master can send sequential requests to all the destinations,i.e. the slaves individually. Thus, there is a need for a system andmethod of use that saves time and can potentially save wires andprocessing, and the like.

Furthermore, in addition to the number of transactions which occur inthe circumstances described above, the method of broadcasting, or themethod of sending a transaction from a master to a slave, also requiresa switch. In sending master/slave transactions in a network or NoC,switches are used as the gatekeepers for determining which transactionsare sent through which ports. In this way, and as is commonly used, theingress port and the egress port are connected to a switch so that apacket entering the switch has to be decoded, routed, and encoded beforeexiting the switch. In this way, in modern and known, applicationsutilize the master, and all transactions being sent to a slave must bewritten individually, transacted individually through a switch. Thisrequires a great deal of processing and limits capabilities and ratesthat a master can send sequential requests to all the destinations basedon both writing processing and switch availability and processing.Therefore, there is a need in the art for a system and method of usethat saves time, transacting power, and enhances transaction efficiency,and the like.

SUMMARY

The invention relates generally to broadcasting transactions in anetwork-on-chip (NoC). More specifically, and without limitation, theinvention relates to a broadcast switch for broadcasting transactions.More specifically, and without limitation, the invention relates to abroadcast switch system, broadcast management switching system, andmethods of use in NoC.

The invention provides new systems and methods for broadcastingtransactions in a NoC. More specifically, the invention provides the artwith a broadcast switch. In other words, the invention provides thestate of the art with a new building block for a NoC.

In the arrangements shown, and in this invention, a broadcast switch ispresented which serves as a broadcast adapter that also performs therouting functions of a switch. These routing functions, in the inventionare performed simultaneously with the transaction broadcast processing,in what results as a much more efficient means of broadcastingtransactions and/or sending transactions, particularly with respect tothe master/slave relationships that exist in transactions.

Said another way, the invention provides a broadcast adapter to the artwhich also has switching functionalities. In other words, the mechanismprovided can perform transaction broadcasting while also performingswitch functions. The broadcast switch, disclosed herein, combines thefunctionalities of a broadcast adapter and a switch into one unit, whichperforms transaction broadcast and switching simultaneously. Thiseliminates many of the transactions that occur in modern processes,especially those related to crossing physical boundaries of an adapterand a switch.

In other words, the systems and methods presented herein simplify thenetwork on chip because of fewer linked, fewer instantiations, and fewerwiring of building blocks, among other advantages. In the arrangementshown herein, as one example, the broadcast switch reduces the number oflinks required in a broadcast adapter with a switch. Furthermore, thebroadcast switch implements route table lookup and port table lookup,and as a result the broadcast switch can broadcast transactions tomultiple destinations via the same egress port. For this reason, amongothers, this new building block and its corresponding functionalities,dramatically increase the capabilities of the egress ports while alsorequiring fewer egress ports for assembly of the overall system.

Systems that embody the invention, in accordance with the aspectsthereof, are typically designed by describing their functions inhardware description languages. Therefore, the invention is alsoembodied in such hardware descriptions, and methods of describingsystems as such hardware descriptions, but the scope of the invention isnot limited thereby.

BRIEF DESCRIPTION OF THE DRAWINGS

In order to understand the invention more fully, reference is made tothe accompanying drawings or figures. The invention is described inaccordance with the aspects and embodiments in the following descriptionwith reference to the drawings or figures (FIG.), in which like numbersrepresent the same or similar elements. Understanding that thesedrawings are not to be considered limitations in the scope of theinvention, the presently described aspects and the presently understoodbest mode of the invention are described with additional detail throughuse of the accompanying drawings.

FIG. 1 illustrates a flow process for broadcasting a request from amaster to multiple slaves using a broadcast switch in accordance withthe aspects and embodiments of the invention.

FIG. 2 illustrates a flow process for broadcasting a request from amaster to multiple slaves using a broadcast switch in accordance withthe aspects and embodiments of the invention.

FIG. 3 illustrates a flow process for broadcasting a request from amaster to multiple slaves using a broadcast switch in accordance withthe aspects and embodiments of the invention.

FIG. 4 illustrates a network-on-chip with a master and multiple slavesin accordance with the aspects and embodiments of the invention.

FIG. 5 illustrates a NoC with a master and multiple slaves in accordancewith the aspects and embodiments of the invention.

FIG. 6 illustrates a NoC with a master and multiple slaves, which alsoshows integration of a plurality of writers, in accordance with theaspects and embodiments of the invention.

FIG. 7 illustrates an embodiment of a broadcast switch showing aningress and at least one egress in accordance with the aspects andembodiments of the invention.

FIG. 8 illustrates an embodiment of a plurality of broadcast switchesarranged, such that stringing together multiple broadcast switches mayenhance the functionality, in accordance with the aspects andembodiments of the invention.

FIG. 9 illustrates an embodiment of a plurality of broadcast switchesarranged such that stringing together multiple broadcast switches mayenhance the functionality of an embodiment as is shown in the invention.

FIG. 10 illustrates an embodiment of a broadcast switch showing aningress and at least one egress in accordance with the aspects andembodiments of the invention.

FIG. 11 illustrates an embodiment of a broadcast switch showing aningress and at least one egress in accordance with the aspects andembodiments of the invention.

FIG. 12 illustrates an embodiment of a system, which shows a broadcastswitch with an integrated transform function, in accordance with theaspects and embodiments of the invention.

FIG. 13 illustrates an embodiment of a system, which includes abroadcast switch with at least one buffer integrated into the broadcastswitch, in accordance with the aspects and embodiments of the invention.

FIG. 14 illustrates an embodiment of the system and a broadcast switchchannel range and special channel for various broadcast switch addressesand target addresses in accordance with the aspects and embodiments ofthe invention.

DETAILED DESCRIPTION

In the following detailed description, reference is made to theaccompanying drawings which form a part hereof, and in which is shown byway of illustration specific embodiments in which the invention may bepracticed. These embodiments are described in sufficient detail toenable those skilled in the art to practice the invention, and it is tobe understood that other embodiments may be utilized and thatmechanical, procedural, and other changes may be made without departingfrom the spirit and scope of the invention(s). The following detaileddescription is, therefore, not to be taken in a limiting sense, and thescope of the invention(s) is defined only by the appended claims, alongwith the full scope of equivalents to which such claims are entitled.

As used herein, the terminology such as vertical, horizontal, top,bottom, front, back, end, sides and the like are referenced according tothe views, pieces and figures presented. It should be understood,however, that the terms are used only for purposes of description, andare not intended to be used as limitations. Accordingly, orientation ofan object or a combination of objects may change without departing fromthe scope of the invention.

Reference throughout this specification to “one embodiment,” “anembodiment,” “one example,” or “an example” means that a particularfeature, structure, or characteristic described in connection with theembodiment or example is included in at least one embodiment of theinvention. Thus, the appearance of the phrases “in one embodiment,” “inan embodiment,” “one example,” or “an example” in various placesthroughout this specification are not necessarily all referring to thesame embodiment or example. Furthermore, the particular features,structures, databases, or characteristics may be combined in anysuitable combinations and/or sub-combinations in one or more embodimentsor examples. In addition, it should be appreciated that the figuresprovided herewith are for explanation purposes to persons ordinarilyskilled in the art and that the drawings are not necessarily drawn toscale.

Embodiments in accordance with the invention may be embodied as anapparatus, method, or computer program product. Accordingly, theinvention may take the form of an entirely hardware-comprisedembodiment, an entirely software-comprised embodiment (includingfirmware, resident software, micro-code, etc.), or an embodimentcombining software and hardware aspects that may all generally bereferred to herein as a “circuit,” “module,” or “system.” Furthermore,embodiments of the invention may take the form of a computer programproduct embodied in any tangible medium.

Any combination of one or more computer-usable or computer-readablemedia may be utilized. For example, a computer-readable medium mayinclude one or more of a portable computer diskette, a hard disk, arandom access memory (RAM) device, a read-only memory (ROM) device, anerasable programmable read-only memory (EPROM or Flash memory) device, aportable compact disc read-only memory (CDROM), an optical storagedevice, and a magnetic storage device. Computer program code forcarrying out operations of the invention may be written in anycombination of one or more programming languages. Such code may becompiled from source code to computer-readable assembly language ormachine code suitable for the device or computer on which the code willbe executed.

Embodiments may also be implemented in cloud computing environments. Inthis description and the following claims, “cloud computing” may bedefined as a model for enabling ubiquitous, convenient, on-demandnetwork access to a shared pool of configurable computing resources(e.g., networks, servers, storage, applications, and services) that canbe rapidly provisioned via virtualization and released with minimalmanagement effort or service provider interaction and then scaledaccordingly. A cloud model can be composed of various characteristics(e.g., on-demand self-service, broad network access, resource pooling,rapid elasticity, and measured service), service models (e.g., Softwareas a Service (“Saas”), Platform as a Service (“PaaS”), andInfrastructure as a Service (“IaaS”)), and deployment models (e.g.,private cloud, community cloud, public cloud, and hybrid cloud).

As used herein, a “source,” a “master,” and an “initiator” refer tosimilar intellectual property (IP) blocks, modules, or units; the termsare used interchangeably within the scope and embodiments of theinvention. As used herein, a “sink,” a “slave,” and a “target” refer tosimilar IP modules or units and the terms are used interchangeablywithin the scope and embodiments of the invention. As used herein, atransaction may be a request transaction or a response transaction.Examples of request transactions include write request and read request.

The flowchart and block diagrams in the attached figures illustrate thearchitecture, functionality, and operation of possible implementationsof systems, methods, and computer program products according to variousembodiments of the invention. In this regard, each block in theflowchart or block diagrams may represent a module, segment, or portionof code, which comprises one or more executable instructions forimplementing the specified logical function(s). It will also be notedthat each block of the block diagrams and/or flowchart illustrations,and combinations of blocks in the block diagrams and/or flowchartillustrations, may be implemented by special purpose hardware-basedsystems that perform the specified functions or acts, or combinations ofspecial purpose hardware and computer instructions. These computerprogram instructions may also be stored in a computer-readable mediumthat can direct a computer or other programmable data processingapparatus to function in a particular manner, such that the instructionsstored in the computer-readable medium produce an article of manufactureincluding instruction means which implement the function/act specifiedin the flowchart and/or block diagram block or blocks.

All illustrations of the drawings are for the purpose of describingselected versions of the invention and are not intended to limit thescope of the invention.

The following descriptions are in reference to FIG. 1 through FIG. 14 .The invention relates to broadcasting transactions in a network-on-chip(NoC). More specifically, and without limitation, the invention relatesto a broadcast switch for broadcasting transactions. More specifically,and without limitation, the invention relates to a broadcast switchsystem, broadcast management switching system, and methods of use inNoC.

In accordance with some aspects and embodiment of the invention, acomponent for NoC is disclosed, which is a broadcast capable switch orbroadcast switch 20. The broadcast switch 20 implements a new approachto sending a transaction from a master to multiple slaves and/or frommultiple masters to multiple slaves. According to the variousembodiments and aspects of the invention depicted herein, a specialrange of addresses can be used. The NoC broadcasts a transactionreceived at one or more of the special addresses, which is within aspecial range of addresses. The broadcasted transactions can reachmultiple destinations and/or slaves simultaneously instead of sending abroadcasted transaction to a single destination, while also providingswitching functionality, and more.

One of many advantages is the efficiency of operation by having abroadcast adapter functionality and switching functionality built into asingle, more efficient, handler of transactions. Another advantage,among many, as disclosed herein, is efficiency in the overall operationof broadcast networks and transaction handling that includes sending thesame transaction to multiple destinations. Yet another advantage, amongmany, as disclosed herein, includes the ability to perform functions ona transaction prior to broadcasting the transaction. And anotheradvantage, among many, as disclosed herein, is efficiency in the abilityto perform switching functions on a transaction prior to broadcastingthe transaction.

In accordance with the various aspects and embodiments of the invention,the system provides a new component that can be used for NoC,particularly a new building block in broadcasting transactionfunctionality. In accordance with the various aspects and embodiments ofthe invention, the broadcast switch disclosed herein is a broadcastadapter and performs routing function of a switch when performing atransaction broadcast. In accordance with the various aspects andembodiments of the invention, the broadcast switch combines thefunctionalities of a broadcast adapter and a switch into one buildingblock. The broadcast switch performs transaction broadcast functionalityand switching functionality simultaneously. In this way, this is thefirst building block component provided for NoC that performs thesefunctionalities simultaneously.

Furthermore, the broadcast switch, disclosed herein, eliminates a numberof transaction overheads that incur when crossing the physicalboundaries of an adapter and a switch. These physical boundaries mayinclude wires, processing, and increased sizing of NoC design. In thisway, the invention eliminates some of the wires, reduces processingdelays, and decreases the size of NoC design. These features, and othersare achieved through use and implementation of the new building blockdisclosed herein.

In accordance with the various aspects and embodiments of the invention,the elimination of many transaction overheads that occur as a result ofuse of the broadcast switch simplifies the NoC because fewer links andfewer instantiations and wirings of building blocks are needed. Thebroadcast switch disclosed herein reduces, by up to half or at leasthalf, the number of links required in design using a broadcast adapterwith a switch. In accordance with some aspects and embodiments of theinvention, the broadcast switch can retrieve or lookup route informationand port information (for an intended target or slave) from lookuptables based on the rote field of an incoming transaction. In accordancewith some aspects and embodiments of the invention, the broadcast switchincludes a route computation module to determine route information for atransaction using the route field of the incoming transaction. Thus, thebroadcast switch implements route table lookup. The broadcast switch canbroadcast transactions to multiple destinations via the various routes,as selected from the route table, by modifying the incomingtransaction's route field with the updated routing information. Inaccordance with some aspects and embodiments of the invention, theincoming transaction is updated, modified, or changes by a modify moduleof the broadcast switch.

Additionally, because the broadcast switch implements egress tablelookup, the broadcast switch can broadcast transactions to multipledestinations via the same egress port. This functionality, among otherfunctionalities and advantages can be useful, and provide enhancedefficiencies, when the number of egress ports are less than the numberof destinations, and in many other circumstances, as disclosed hereinand contemplated for use.

In accordance with some aspects and embodiment of the invention, thebroadcast switch is a multicast switch that sends transactions multipletargets, on the request side, using one egress port. As used herein, theterm “transaction” includes and is used interchangeably with: “incomingtransactions,” “arriving transactions,” “departing transactions,”“outgoing transactions,” “data,” “data packets,” and “packets.”

In accordance with aspects and embodiments of the invention, thebroadcast switch also includes buffering sufficient to hold the entirepacket because the broadcast switch sends out (broadcasts) transactionsor packets serially, especially when the broadcast switch has feweregress ports than the number to destination, to which it isbroadcasting. For example, when the broadcast switch is broadcasting totwo or more destination using a one egress port. This approach has theadvantage of allowing the source (initiator) to send one request to thebroadcast switch and avoid having to individually create the separaterequests.

Although the invention may be explained in relation to examples ofembodiments, it is to be understood that many other possiblemodifications and variations can be made without departing from thespirit and scope of the invention.

In accordance with the various aspects and embodiments of the invention,a broadcast management system 10 is shown (hereafter known as “broadcastmanagement system”, “broadcast capable switch system”, or “broadcastsystem” or simply “system”). Broadcast management system 10 is formed ofany suitable size, shape and design and the scope of the invention isnot limited thereby. In the arrangement shown, as one example, system 10is part of a NoC, shown in accordance with an embodiment of theinvention and may also include remote servers, databases, applicationservers, application databases, application programming interfaces, userdatabases, event databases, transmission rule databases, rulesdatabases, data transmission components and/or features, mobileapplications, and/or computers, cellular transmission, data packets, andthe like, that fulfill the functions disclosed herein.

Previous NoC technology required, at best, a plurality of broadcastadapters and a plurality of switches. In these arrangements, theconnection, from end-to-end (and in this case master to slave) must gothrough a switch. The switch computes the route and modifies the routefield in the packet and knows which port is destined for a packet. Assuch, the ingress port and the egress ports of a broadcast adapter mustbe connected to switches. Therefore, a packet traveling through thesystem, from end to end (or master to slave/target), has to be decoded,routed, and encoded at each switch level in the system. So at a minimum,this decoding, routing, and encoding must occur twice before a datapacket can reach the destination.

The invention, in accordance with various aspects and embodiments,provides these functionalities, systems and solutions in the followinggeneral manner. With reference to FIG. 4 as disclosed herein, abroadcast switch (0) 20 connects other broadcast switches (1 and 2) 20to a master 12 using network interface unit (NIU) 14. There is also aNIU 14 between the broadcast switch (1) 20 and a target 22. Thebroadcast switch can perform all the functionality for both an adapterand various switches. Further, the target 22 is in communication with(can receive transactions from) the broadcast switch 20 (as shown) aswell as other sources through other network elements (not shown forclarity). As such, there are routes that connect a target to thebroadcast switches for receiving multi-cast transactions and there areroutes (not shown) connected to the target for receiving normaltransactions.

The master 12 sends transaction to the broadcast switch, such asbroadcast switch 20. The transactions include information that tells thebroadcast switch if the transaction is a multicast transaction or anormal transaction (i.e. intended for just one target or destination).In accordance with some aspects and embodiments of the invention, thebroadcast switch acts a normal switch based on the information in theincoming packet (transaction) that identifies the incoming packets as anormal transaction. In accordance with some embodiments of theinvention, the broadcast switch connects directly to any IP block. Inthis way, the broadcast switch can compute the route and modify theroute field in the packet and/or data packet. Additionally, thebroadcast switch is capable of determining routes to a destination.Thus, the broadcast switch includes the ability to mask out any one ormore of the targets, to which it is capable of broadcasting, based oninformation contained in transaction (the incoming request).

In accordance with some aspect and embodiment of the invention, thebroadcast switches that have more than one egress port can select any ofone or more of the egress ports, from among all the egress ports, thatwill receive a broadcast of the packet and/or data packet. In accordancewith some aspect and embodiment of the invention, the broadcast switchesthat have one egress port can select any of one or more of thedestinations (targets), from among all the possible targets to which itcan broadcast using the egress port, that will receive a broadcast ofthe packet and/or data packet. Masking is based on information provideda request (broadcast packet) that is received at a broadcast switch. Therequest includes information about masking. The information is used bythe broadcast switch to mask out (prevent broadcasting to) specifictarget(s) from a fixed set of targets, which are associated with abroadcast tree for that broadcast switch.

In the arrangement shown, system 10 provides these advantages and morein the following way. When a packet and/or data packet is broadcast fromthe master 12, the broadcast switch looks up a target path route tableusing the target identification in a request packet to retrieve theroute to the target from the broadcast switch. In this way, the routefor the packet is put in the request packet going out to the target(also known as parallel lookup function, which can operate independentlyof transmission order).

Following the lookup, or simultaneously and in parallel, of the targetpath route table, the broadcast switch 20 determines and/or looks up thetarget path port table using the target identification. This is alsodone in a request packet to retrieve the egress port number of thebroadcast switch 20. In this way, the request packet going out to thetarget is sent to the egress port as indicated in the targetidentification of the target path port table. One advantage is that thisfeature allows the request packets going to multiple targets to be sentvia the same egress port.

As another arrangement, and disclosed herein, the target path routetable and the target path port table can be set up such that thebroadcast switch can broadcast the packet and/or transaction to multipledestinations via the same egress port. This novel feature and novelcapability is advantageous for many reasons. One reason this novelfeature is advantageous is because in some situations, the number ofegress ports of the broadcast switch are less than the number ofdestinations. In this way, the utilization of and efficiency of anegress port greatly appreciates due to the organizationalcharacteristics of the broadcast switch.

Following the lookup, or simultaneously and in parallel, of the targetpath route table and the target path port table, the broadcast switchdetermines and/or looks up the source path route table using the sourceidentification in the response packet. The broadcast switch uses thesource path route table and source identification to determine the routeto the source from the broadcast switch. In this way, the route is putin the response packet going out to the source.

Once the broadcast switch determines the target path route table, thetarget path port table, and the source path route table, the broadcastswitch performs the routing function of a switch for the request pathand the response path.

In the arrangement shown, as one example, system 10 performs thesefunctionalities as described in the methods herein, in a novel way, andvarious components, functionalities, and features to carry out thesefunctions. Some of these components (as are further described herein)may include a master 12, a NIU 14, a queue, a transaction, a writer 18,a broadcast switch 20, a slave 22 (or “target”), an ingress port 26, anegress port 28, a broadcast capable switch (“BCS”) address range 30, aBCS address 32, a slave address range 34 (or “target address range”), aslave address 36 (or “target address”), a buffer 40, a target path routetable, a target path port table, a source path route table, and othervarious features, components and functionalities. Furthermore, system 10includes various pipeline elements, in accordance with variousembodiments of the invention, some of which are shown and some of whichare not shown in the arrangements herein.

In accordance with the various aspects and embodiments of the invention,as one example, system 10 includes master 12. Master 12 (also known as“initiator”, a “source,” or a “sender”) is formed of any suitable size,shape, and design and is configured to create and/or send packets and/ordata packets and/or send data transmissions. In the arrangement shown,as one example, an initiator may be a computer, CPU, or server, or thelike, which refers to an internet protocol module or unit. The primarypurpose of the master 12, as disclosed herein, is to create and/or sendthe data packet and potentially provide other information related to thetransmission being broadcast.

In the arrangement shown, as one example, the master 12 is incommunication with the NIU 14 (to be further discussed herein).Subsequently, the NIU 14 is connected to a plurality of slaves (or“targets”) 22 (to be further discussed herein). In this way, the master12 is enabled to send transactions to multiple slaves 22.

While a single master is discussed herein, as one example, it will beapparent that many other embodiments are contemplated, which includemultiple masters and multiple slaves, even though only one master andseveral slaves are shown for clarity in this example; for ease ofexplanation.

In accordance with the various aspects and embodiments of the invention,one master may send a transaction or request to multiple slaves 22.Furthermore, a master 12 may send a request to a slave 22, using anaddress to select the slave 22. Furthermore, the master 12 may write thesame data to multiple different slaves 22.

In accordance with the various aspects and embodiments of the invention,in the arrangement shown as an example, system 10 includes a pluralityof slaves 22. Slaves 22 (or “targets”) are formed of any suitable size,shape, and design and are configured as target addresses and/or targetinternet protocols which are set up to receive transactions (to befurther discussed herein) from a master via the pipeline discussedherein.

In accordance with the various aspects and embodiments of the invention,the plurality of slaves 22 are, each or alone, in communication with theNIU 14 (to be further discussed herein). Subsequently, the NIU 14 isconnected to a plurality of masters 12. In this way, the target 22 isenabled to receive transactions from one master or multiple masters 12.

While a single target 22 is discussed on occasion herein, as oneexample, it will be apparent that many other embodiments arecontemplated, such as two targets, three targets, four targets, anyother number of targets and multiple masters, even though only onemaster and one target might be shown for clarity in this example; forease of explanation. Said another way, one master may send a transactionor request to multiple slaves 22. Furthermore, a master 12 may send arequest to a slave 22, using an address to select the slave 22.Furthermore, the master 12 may write the same data to multiple differentslaves 22. In the arrangement shown, as one example, system 10 includesa plurality of transactions. Transactions are formed of any suitablesize, shape, and design, and is a term used to describe data that isbeing broadcast by the master 12 and/or data that is being requested bythe master 12.

In the arrangement shown, as one example, one or a plurality of masters12 may send a transaction and/or request a transaction to or frommultiple slaves 22. Furthermore, a master 12 may send a transactionrequest to a slave 22, using an address to select the slave 22.Furthermore, the master 12 may write the same data to multiple differentslaves 22.

In the arrangement shown, as one example, arriving and departingtransactions may come from various addresses, sources, with varyingpriorities. Additionally, and in the arrangement shown, arriving anddeparting transactions may come from AXI, ACE lite, CHI, and may carry aquality of service field and various other identifiers which can be usedto determine the requirements and/or goals.

In accordance with the various aspects and embodiments of the invention,system 10 includes a network interface unit (NIU) 14 (or “networkinterface”). NIU 14 is formed of any suitable size, shape, and designand is configured as the point of interconnection between a master 12(or simply a computer) and a private or public network. In accordancewith the various aspects and embodiments of the invention, a networkinterface may be a network interface card and need not have a physicalform. In this way, an implemented software is also hereby considered asa NIU 14.

Additionally, and in the arrangement shown, system 10 may also includenetwork interface units 14 located on the slave and/or target 22 side.In this way, a plurality of network interface units 14 may be connectedto slaves 22 and are used to convert the protocols used inside the NoCto the protocols used by the targets 22. In this way, the networkinterface units 14 enable the master 12 to communicate through thesystem 10 with a plurality of slaves 22.

Said another way, the network interface units 14 translate the incomingtransactions, formed by the master 12, to the protocol used inside theNoC, for transport. In the arrangement shown, as one example, the NIU 14is in communication with the master 12, with the broadcast switch 20 andpotentially other various pipeline elements, in accordance with variousembodiments of the invention, some of which are shown and some of whichare not shown.

In accordance with the various aspects and embodiments of the invention,the system 10 includes a broadcast switch 20. Broadcast capable switch20 (also known as “broadcast switch”) is formed of any suitable size,shape, and design and is configured to receive a transaction (and/or apacket representing a request and/or request transaction). The broadcastswitch 20 is configured to duplicate the packet, send the packetduplicates to each request egress port (to be further discussed herein).Broadcast switch 20 may handle a request packet of data, whichrepresents a transaction, arriving at an ingress port (to be furtherdiscussed herein). The packet is duplicated by the broadcast switch, asneeded, and depending on the request type and the duplicates are sent aspackets through various egress ports.

In accordance with the various aspects and embodiments of the invention,the broadcast switch 20 may handle a packet which arrives at an ingressport by duplicating, then receiving a communication through a networkinterface to a slave 22, and subsequently sending additionalcommunications through a second or third network interface which issimilar along a second or third path.

In this way, the broadcast switch 20 handles broadcast adapterfunctionality as well as broadcast switch functionality within a singlebuilding block. These and other methods are hereby contemplated for use.The example discussed above is one of many processes and or transactionsthat can be handled by a broadcast switch 20.

For another example, a broadcast switch 20 can connect directly to othernetwork interfaces, other broadcast adapter, and/or other switchesbecause the broadcast switch computes the route and modifies the routefield in the packet, and can determine which port is the port destinedfor the relative transaction 22.

Said another way, a broadcast switch 20 can connect to the master 12 andslave 22 network interface. The connection travels through the broadcastswitch 20. The broadcast switch 20 is allowed to compute the route andmodify the route field in the transaction and, hence, knows port, forwhich, the transaction is intended. In this way, the ingress port andthe egress ports of the broadcast switch can be decoded, routed, andencoded within the broadcast switch.

Referring to FIG. 1 , FIG. 2 , and FIG. 3 , in accordance with thevarious aspects and embodiments of the invention, a process is shownwherein the broadcast switch 20 performs both the functions of abroadcast adapter and a broadcast switch. The broadcast switch 20 looksup a target path route table using the target identification in arequest packet to retrieve the route to the target front of (down path)the broadcast switch. This route is put in the request packet going outto the target.

The broadcast switch 20 looks up a target path port table using thetarget identification in a request packet to retrieve the egress portnumber of the broadcast switch. The request packet going out of thetarget is sent to the egress port as indicated. Furthermore, thisfeature allows the request packets to go to multiple targets and to besent via the same egress port.

In accordance with the various aspects and embodiments of the invention,the target path route table and the target path port table are set up,in the arrangement shown as one example, such that the broadcast switch20 can broadcast a transaction to multiple destinations (or targets) viathe same egress port. In the arrangement shown, as one example, thisfunctionality is useful when the number of egress ports of the broadcastswitch 20 are less than the number of targets 22.

Subsequently, the broadcast switch 20 looks up a source path route tableusing the source identification in a response packet to retrieve theroute to the source from the broadcast switch 20. In this way, the routeis placed as part of the response packet going out to the source. Withthe target path route table, target path port table, and the source pathroute table, the broadcast switch 20 performs broadcast adapterfunctions and, simultaneously, the routing functions of a plurality ofswitches for the request path and the response paths.

In accordance with the various aspects and embodiments of the invention,as one example, broadcast switch 20 includes an ingress port 26 andegress ports 28. Ingress port 26 is formed of any suitable size, shape,and design and is configured as the pipeline on the request side of thetransaction. Egress port 28 is formed of any suitable size, shape, anddesign and is generally configured as the port or ports through whichthe broadcast switch 20 will send transactions and/or duplicatetransactions through.

In this way, and as discussed herein, the broadcast switch 20 duplicatestransactions received on the ingress port and sends the duplicatedtransactions in the network using the egress ports. Referring to thefigures, as an example, a broadcast switch 20, of FIG. 12 , is shownconfigured with one ingress port 26 and three request egress ports 28.Similarly, the figures also represent (not shown) three response ingressports and one response egress port, to handle a response coming from alltargets connected to the request egress ports in accordance with variousaspects and embodiments of the invention. In the arrangement shown, asone example, various embodiments are shown, various amounts of ingressand egress ports are contemplated for use. Furthermore, the broadcastswitch 20 may perform response aggregation and/or combine all theresponses that correspond to one duplicate and/or more duplicate requestpackets, into a single response packet using a combination function. Thecombined responses may be sent and/or requested through various ports.

In accordance with the various aspects and embodiments of the invention,system 10 and more specifically, the broadcast switch 20 may include abuffer 40. Buffer 40 is formed of any suitable, size, shape and designand is configured to permit independent progress of each egress portwithout having to implement one first in, first out per egress port. Inthis way, the broadcast switch 20 is able to implement a complexbroadcast network while avoiding deadlocks, latency, or drop.

In accordance with the various aspects and embodiments of the invention,system 10 also includes a pipeline, which may also be referred to as apath. Path is formed of any suitable size, shape, and design, and isconfigured as a dynamically situated pipeline for the purpose ofproviding transmission and networking on chip. Path or plurality ofpaths may also simply be known as a pipeline through which the dataand/or data packets, in the arrangement, travels and interacts with thesystem 10 herein. Furthermore, system 10 may include additional switchesand/or broadcast adapters and the like.

As one example, a system and method are presented in this invention forthe purpose of providing transmission broadcasting for a single targetand single broadcast, as well as for a single target on amulti-broadcast, as well as, multi-target on multi-broadcast, andadditionally through tree building utilizing a plurality of broadcastswitches.

In accordance with the various aspects and embodiments of the invention,which is shown in FIG. 1 through FIG. 14 , a first broadcast switchconnects other broadcast switches. In this way, the broadcast switch canperform all the functionality as previously needed for both an adapterand various switches. In this way, the broadcast switch can connectdirectly to and compute the route and modify the route field in thepacket and/or data packet. In this way, the broadcast switch knows whichport is destined for a packet and/or data packet.

In the arrangement shown, system 10 provides these advantages and morein the following way. When a packet and/or data packet is broadcast fromthe master, the broadcast switch looks up a target path route tableusing the target identification in a request packet to retrieve theroute to the target from the broadcast switch. In this way, the routefor the packet is put in the request packet going out to the target(also known as parallel lookup function, which can operate independentlyof transmission order).

Following the lookup, or simultaneously and in parallel, of the targetpath route table, the broadcast switch determines and/or looks up thetarget path port table using the target identification. This is alsodone in a request packet to retrieve the egress port number of thebroadcast switch. In this way, the request packet going out to thetarget is sent to the egress port as indicated in the targetidentification of the target path port table. As one advantage, thisfeature allows the request packets going to multiple targets to be sentvia the same egress port.

As another arrangement, and disclosed herein, the target path routetable and the target path port table can be set up such that thebroadcast switch can broadcast the packet and/or transaction to multipledestinations via the same egress port. This novel feature and novelcapability is advantageous for many reasons. One reason this novelfeature is advantageous is because in some situations, the number ofegress ports of the broadcast switch are less than the number ofdestinations. In this way, the utilization of and efficiency of anegress port greatly appreciates due to the organizationalcharacteristics of the broadcast switch.

Following the lookup, or simultaneously and in parallel, of the targetpath route table and the target path port table, the broadcast switchdetermines and/or looks up the source path route table using the sourceidentification in the response packet. The broadcast switch uses thesource path route table and source identification to determine the routeto the source from the broadcast switch. In this way, the route is putin the response packet going out to the source.

Once the broadcast switch determines the target path route table, thetarget path port table, and the source path route table, the broadcastswitch performs the routing function of a switch for the request pathand the response path.

These and other methods of use and processes are hereby contemplated foruse. For example, parallel processing can provide tremendous speedups.This is important for applications such as deep neural networkscomputations, which can require distribution of the same dataset tomultiple nodes simultaneously. In accordance with some aspects of theinvention, designers of neural network solutions can take advantage ofthe broadcast switches 20 for implementing transaction completion inparallel or simultaneously. For example, various aspects and embodimentsof the invention can be implemented in the field of artificialintelligence computations and deep network accelerators. Whenimplemented in hardware and software, such systems can take fulladvantage of the parallelism of broadcasting using a NoC that includesbroadcast switches 20 and run orders of magnitude faster.

Certain examples have been described herein and it will be noted thatdifferent combinations of different components from different examplesmay be possible. Salient features are presented to better explainexamples; however, it is clear that certain features may be added,modified and/or omitted without modifying the functional aspects ofthese examples as described.

Various examples are methods that use the behavior of either or acombination of machines. Method examples are complete wherever in theworld most constituent steps occur. For example and in accordance withthe various aspects and embodiments of the invention, IP elements orunits include: processors (e.g., CPUs or GPUs), random-access memory(RAM—e.g., off-chip dynamic RAM or DRAM), a network interface for wiredor wireless connections such as ethernet, WiFi, 3G, 4G long-termevolution (LTE), 5G, and other wireless interface standard radios. TheIP may also include various I/O interface devices, as needed fordifferent peripheral devices such as touch screen sensors, geolocationreceivers, microphones, speakers, Bluetooth peripherals, and USBdevices, such as keyboards and mice, among others. By executinginstructions stored in RAM devices processors perform steps of methodsas described herein.

Some examples are one or more non-transitory computer readable mediaarranged to store such instructions for methods described herein.Whatever machine holds non-transitory computer readable media comprisingany of the necessary code may implement an example. Some examples may beimplemented as: physical devices such as semiconductor chips; hardwaredescription language representations of the logical or functionalbehavior of such devices; and one or more non-transitory computerreadable media arranged to store such hardware description languagerepresentations. Descriptions herein reciting principles, aspects, andembodiments encompass both structural and functional equivalentsthereof. Elements described herein as coupled have an effectualrelationship realizable by a direct connection or indirectly with one ormore other intervening elements.

It will be appreciated by those skilled in the art that other variousmodifications could be made to the device without parting from thespirit and scope of this invention (especially various programmablefeatures). All such modifications and changes fall within the scope ofthe claims and are intended to be covered thereby. Furthermore,practitioners skilled in the art will recognize many modifications andvariations. The modifications and variations include any relevantcombination of the disclosed features. Descriptions herein recitingprinciples, aspects, and embodiments encompass both structural andfunctional equivalents thereof. Elements described herein as “couple” or“communicatively coupled” have an effectual relationship realizable by adirection connection or indirect connection, which uses one or moreother intervening elements. Embodiments described herein as“communicating” or “in communication with” another device, module, orelements include any form of communication or link and include aneffectual relationship. For example, a communication link may beestablished using a wired connection, wireless protocols, near-fieldprotocols, or radio frequency identification (RFID).

The scope of the invention, therefore, is not intended to be limited tothe exemplary embodiments shown and described herein. Rather, the scopeand spirit of the invention is embodied by the claims.

What is claimed is:
 1. A broadcast switch for sending a transaction,which is generated by a master, to a plurality of slaves, the broadcastswitch comprising: at least one ingress port for receiving thetransaction from the master; a plurality of egress ports for sendingupdated transactions to the plurality of slaves, wherein each of theplurality of egress ports receives one updated transaction of theupdated transactions; a route computation module for determining routeinformation for each slave of the plurality of slaves that will receivethe transaction; and a transform module that changes a route field ofthe transaction using the computed route information to produce theupdated transactions, wherein the updated transactions are sent to eachof the plurality of slaves through the respective egress port of theplurality of egress ports that is communicatively connected to eachslave that should receive the transaction.
 2. The broadcast switch ofclaim 1, wherein the route computation module performs route tablelookup.
 3. The broadcast switch of claim 1, wherein the routecomputation module performs port table lookup.
 4. The broadcast switchof claim 1, wherein the route computation module performs source pathroute table lookup.
 5. The broadcast switch of claim 1, furthercomprising a response ingress port for receiving a response packet theresponse packet that includes the master's identification.
 6. Thebroadcast switch of claim 1, wherein the updated transaction aretransformed duplicate transactions generated by the transform moduleperforming a function on the transaction and wherein each of thetransformed duplicate transactions are sent through one egress port ofthe plurality of egress ports.
 7. The broadcast switch of claim 1,further comprising a buffer in communication with the ingress port andthe plurality of egress ports, wherein the buffer stores data when thereis backpressure on any egress port of the plurality of egress ports. 8.The broadcast switch of claim 1 further comprising: a plurality ofresponse ingress ports; and at least one response egress port, whereinresponse packets are received, one at each of the plurality of responseingress ports, and the response packets received through the pluralityof response ingress ports are combined to form a combined responsepacket that is sent to the response egress port.
 9. A system forbroadcasting comprising: a master, wherein the master generates at leastone transaction; a plurality of targets, wherein the plurality oftargets receive and service the transaction; and a network-on-chip (NoC)in communication with the master and the plurality of targets, whereinthe NoC includes a plurality of broadcast switches, each of theplurality of broadcast switches having: at least one ingress port; and aplurality of egress ports, wherein at least one broadcast switch of theplurality of broadcast switches is in communication with the master andin communication with a set of the plurality of targets, wherein each ofthe plurality of broadcast switches duplicate the at least onetransaction and each of the plurality of egress ports receive oneduplicated transaction.
 10. The system of claim 9, further comprising atransform module that performs a function on the transaction to generatea plurality of transformed duplicate transactions and each of theplurality of transformed duplicate transactions are sent through oneegress port of the plurality of egress ports.
 11. The system of claim 10further comprising a control module analyzes a control bit of thetransaction and determines to which of the plurality of egress ports thetransaction is duplicated.
 12. A system for broadcasting comprising: amaster, wherein the master generates at least one transaction; aplurality of targets, wherein the plurality of targets receive andservice the transaction; a network-on-chip (NoC) in communication withthe master and the plurality of targets, wherein the NoC includes aplurality of broadcast switches, each of the plurality of broadcastswitches having: at least one ingress port; and a plurality of egressports, a target path route table that includes a plurality of routes; atarget path port table; and a source path route table, wherein at leastone broadcast switch of the plurality of broadcast switches is incommunication with the master and in communication with a set of theplurality of targets and the at least one broadcast switch.
 13. Thesystem of claim 9, wherein the at least one broadcast switch includes abuffer that stores data when there is backpressure on any given egressport of the plurality of egress ports.
 14. The system of claim 13,wherein at least one of the plurality of broadcast switches includes acontrol module that analyzes a control bit of the transaction anddetermines to which of the plurality of egress ports the transaction isduplicated.
 15. The system of claim 9 further comprising a plurality ofresponse ingress ports that receive response transactions and theresponse transactions are combined to form a complete responsetransaction.
 16. The system of claim 9 further comprising at leastresponse egress port for sending a complete response transaction. 17.The system of claim 12, wherein the at least one broadcast switch usesthe at least one transaction and looks up the target path route table toretrieve a route that is added to a request packet going out to atarget.
 18. The system of claim 17, wherein the at least one broadcastswitch looks up the target path port table using target identificationbased on the at least one transaction to retrieve an egress port numberof the at least one broadcast switch.
 19. The system of claim 12,wherein each of the plurality of broadcast switches duplicates the atleast one transaction and multiple duplicated transactions are sent,each one to a different target, using one egress port of the pluralityof egress ports.